CDS

Accession Number TCMCG064C25324
gbkey CDS
Protein Id XP_020553210.1
Location complement(join(12121473..12121589,12122027..12122119,12123754..12123874,12126170..12126312,12126943..12127024,12127157..12127368,12128024..12128151,12128251..12128380))
Gene LOC105172372
GeneID 105172372
Organism Sesamum indicum

Protein

Length 341aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA268358
db_source XM_020697551.1
Definition putative protease Do-like 14 isoform X3 [Sesamum indicum]

EGGNOG-MAPPER Annotation

COG_category O
Description Trypsin-like serine protease
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01002        [VIEW IN KEGG]
ko03110        [VIEW IN KEGG]
KEGG_ko ko:K08669        [VIEW IN KEGG]
ko:K08784        [VIEW IN KEGG]
EC 3.4.21.108        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko04210        [VIEW IN KEGG]
ko04214        [VIEW IN KEGG]
ko04215        [VIEW IN KEGG]
ko05012        [VIEW IN KEGG]
map04210        [VIEW IN KEGG]
map04214        [VIEW IN KEGG]
map04215        [VIEW IN KEGG]
map05012        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGAGTAAAGGATATGGACTGGATGCTGGAGATAGTCCCAAACATTCGTGCAGCTGTCTTGGCCGTGATACAATTGCAAATGCAGCGGCTAAGGTTGGTCCTGCTGTTGTCAATTTGTCAGTGCCACAAAGTTTTCATGGTATGACTGTGGGTAAAAGCATTGGATCAGGAACCATTATAGATGAGGATGGTACTATCTTGACTTGTGCTCATGTGGTGGTTGATTTTCAAGGCTTGAGGTCTTCATCTAAGGGAAAGGTTGAAGTGACTTTACAGGATGGTCGGTCATTTGAGGGCACAGTGGTGAATGCTGATCTACATTCTGATATAGCAATAGTTAGGATCAAATCTAAAACTCCACTTCCAACAGCAAAACTTGGGAGCTCAAGTAAGCTTCGGCCTGGCGATTGGGTGGTGGCTATGGGCTGTCCCCTTACCCTCCAGAATACCATCACAGCAGGTATTGTAAGTTGTGTTGACCGTAAAAGTAGTGACTTGGGTCTTGGAGGAATGCAAAGGGAGTATTTGCAGACTGATTGTGCAATAAATCAGGGTAATTCTGGTGGACCGCTTGTCAATGTTGATGGAGAAGTTGTAGGTGTAAATATAATGAAAGTGTTGGGGGCTGATGGGTTAAATTTCGCCGTCCCGATTGATTCTGTTTCAAAAATAGTAGAGCACTTCAAAAAGAATGGGAGAGTTGTCCGACCTTGGCTTGGTTTGAAAATGCTTGATCTCAATGACATGATTGTTGCACATCTAAAGGAAAGAAATGCTTCCTTTCCAGATGTCAGCAGAGGAGTTCTTATACCTATGGTATCACCAGGTTCCCCAGCTGATCGTGCTGGATTTCGTCCCGGAGATGTTGTAGTTGAATTTGGTGGGAGGCCTATTGGAAGTATTAAGGAGGTGATCGATATTATGGGGGATAAAATCGGAAAGCCTTTCAAGGCTGTAGTGAAAAGGGCAAACAACATAACTGTGAATTTGACTGTCATTCCTGAAGAAGCAAATCCAGATATGTGA
Protein:  
MSKGYGLDAGDSPKHSCSCLGRDTIANAAAKVGPAVVNLSVPQSFHGMTVGKSIGSGTIIDEDGTILTCAHVVVDFQGLRSSSKGKVEVTLQDGRSFEGTVVNADLHSDIAIVRIKSKTPLPTAKLGSSSKLRPGDWVVAMGCPLTLQNTITAGIVSCVDRKSSDLGLGGMQREYLQTDCAINQGNSGGPLVNVDGEVVGVNIMKVLGADGLNFAVPIDSVSKIVEHFKKNGRVVRPWLGLKMLDLNDMIVAHLKERNASFPDVSRGVLIPMVSPGSPADRAGFRPGDVVVEFGGRPIGSIKEVIDIMGDKIGKPFKAVVKRANNITVNLTVIPEEANPDM